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Abstract 

The problem of simultaneous multicasting of multiple messages with the help of a relay terminal is 
considered. In particular, a model is studied in which a relay station simultaneously assists two transmitters 
in multicasting their independent messages to two receivers. The relay may also have an independent message 
of its own to multicast. As a first step to address this general model, referred to as the compound multiple 
access channel with a relay (cMACr), the capacity region of the multiple access channel with a "cognitive" 
relay is characterized, including the cases of partial and rate-limited cognition. Then, achievable rate regions 
for the cMACr model are presented based on decode-and-forward (DF) and compress-and-forward (CF) 
relaying strategies. Moreover, an outer bound is derived for the special case, called the cMACr without 
cross-reception, in which each transmitter has a direct link to one of the receivers while the connection 
to the other receiver is enabled only through the relay terminal. The capacity region is characterized for a 
binary modulo additive cMACr without cross-reception, showing the optimality of binary linear block codes, 
thus highlighting the benefits of physical layer network coding and structured codes. Results are extended 
to the Gaussian channel model as well, providing achievable rate regions for DF and CF, as well as for a 
structured code design based on lattice codes. It is shown that the performance with lattice codes approaches 
the upper bound for increasing power, surpassing the rates achieved by the considered random coding-based 
techniques. 
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I. Introduction 

Consider two non-cooperating satellites each multicasting radio/TV signals to users on Earth. The cover- 
age area and the quality of the transmission is generally limited by the strength of the direct links from the 
satellites to the users. To extend coverage, to increase capacity or to improve robustness, a standard solution 
is that of introducing relay terminals, which may be other satellite stations or stronger ground stations (see 
Fig. [J). The role of the relay terminals is especially critical in scenarios in which some users lack a direct 
link from any of the satellites. Moreover, it is noted that the relays might have their own multicast traffic to 
transmit. A similar model applies in the case of non-cooperating base stations multicasting to mobile users 
in different cells: here, relay terminals located on the cell boundaries may help each base station reach users 
in the neighboring cells. 

Cooperative transmission (relaying) has been extensively studied in the case of two transmitting users, 
both for a single user with a dedicated relay terminal [1], [2] and for two cooperating users [3]. Extensions 
to scenarios with multiple users are currently under investigation [2], [5] - [11]. In this work, we aim at 
studying the impact of cooperation in the setup of Fig. Q] that consists of two source terminals simultaneously 
multicasting independent information to two receivers in the presence of a relay station. While the source 
terminals cannot directly cooperate with each other, the relay terminal is able to support both transmissions 
simultaneously to enlarge the multicast capacity region of the two transmitters. Moreover, it is assumed that 
the relay station is also interested in multicasting a local message to the two receivers (see Fig. [2]). 

The model under study is a compound multiple access channel with a relay (cMACr) and can be seen 
as an extension of several fundamental channel models, such as the multiple access channel (MAC), the 
broadcast channel (BC) and the relay channel (RC). The main goal of this work is to adapt basic transmission 
strategies known from these key scenarios to the channel at hand and to identify special cases of the more 
general model for which conclusive capacity results can be obtained. 

Below, we summarize our contributions: 

• We start our analysis by studying a simplified version of the cMACr that consists of a MAC with 
a "cognitive" relay (see Fig. [3). In this scenario the cognitive relay is assumed to be aware of 
both transmitters' messages non-causally. We provide the capacity region for this model and several 
extensions. While interesting on its own, this setup enables us to conveniently introduce the necessary 
tools to address the analysis of the cMACr. As an intermediate step between the cognitive relay model 
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Fig. 1. Illustration for an application of the compound multiple access channel with a relay. 

and the more general model of cMACr, we also consider the relay with finite capacity unidirectional 
links from the transmitters and provide the corresponding capacity region. 

• We provide achievable rate regions for the cMACr model with decode-and-forward (DF) and compress- 
and-forward (CF) relaying. In the CF scheme, the relay, instead of decoding the messages, quantizes 
and broadcasts its received signal. This corresponds to the joint source-channel coding problem of 
broadcasting a common source to two receivers, each with its own correlated side information, in a 
lossy fashion, studied in [16]. This result indicates that the pure channel coding rate regions for certain 
multi-user networks can be improved by exploiting related joint source-channel coding techniques. 

• The similarity between the underlying scenario and the classical butterfly example in network coding 
[12] is evident, despite the fact that we have multiple sources and a more complicated network with 
broadcasting constraints and multiple access interference. Yet, we can still benefit from physical layer 
coding techniques that exploit the network coding techniques. In order to highlight the possibility of 
physical layer network coding, we focus on a special cMACr in which each source's signal is received 
directly by only one of the destinations, while the other destination is reached through the relay. This 
special model is called the cMACr without cross-reception. We provide an outer bound for this setting 
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and show that it matches the DF achievable region, apart from an additional sum rate constraint at the 
relay terminal. This indicates the suboptimality of enforcing the relay to decode both messages, and 
motivates a coding scheme that exploits the network coding aspects in the physical layer. 

• Based on the observation above, we are interested in leveraging the network structure by exploiting 
"structured codes". We then focus on a modulo additive binary version of the cMACr, and characterize 
its capacity region, showing that it is achieved by binary linear block codes. In this scheme, the relay only 
decodes the binary sum of the transmitters' messages, rather than decoding each individual message. 
Since the receiver 1 (2) can decode the message of transmitter 1 (2) directly without the help of the 
relay, it is sufficient for the relay to forward only the binary sum. Similar to [17], [20], [21], this result 
highlights the importance of structured codes in achieving the capacity region of certain multi-user 
networks. 

• Finally, we extend our results to the Gaussian case, and present a comparison of the achievable rates 
and the outer bound. Additionally, we extend the structured code approach to the Gaussian channel 
setting by proposing an achievable scheme based on nested lattice codes. We show that, in the case of 
symmetric rates from the transmitters, nested lattice coding improves the achievable rate significantly 
compared to the considered random coding schemes in the moderate to high power regime. 

The cMACr of Fig. [2] can also been seen as a generalization of a number of other specific channels 
that have been studied extensively in the literature. To start with, if there is no relay terminal available, our 
model reduces to the compound multiple access channel whose capacity is characterized in [4]. Moreover, if 
there is only one source terminal, it reduces to the dedicated relay broadcast channel with a single common 
message explored in [2], [5]: Since the capacity is not known even for the simpler case of a relay channel 
[1], the capacity for the dedicated relay broadcast channel remains open as well. If we have two sources but 
a single destination, the model reduces to the multiple access relay channel model studied in [2], [25] whose 
capacity region is not known in the general case either. Furthermore, if we assume that transmitter 1 (and 
2) has an orthogonal side channel of infinite capacity to receiver 1 (2), then we can equivalently consider 
the message of transmitter 1 (2) to be known in advance at receiver 1 (2) and the corresponding channel 
model becomes equivalent to the restricted two-way relay channel studied in [6], [7], [22], and [23]. 

The cMACr model is also studied in [11], where DF and amplify-and-forward (AF) based protocols are 
analyzed. Another related problem is the interference relay channel model studied in [8], [9], [10]: Note 
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Fig. 2. A compound MAC with a relay (cMACr). 



that, even though the interference channel setup is not obtained as a special case of our model, achievable 
rate regions proposed here can serve as inner bounds for that setup as well. 

Notation: To simplify notation, we will sometimes use the shortcut: X{ S y = (xi)i<=s- We employ standard 
conventions (see, e.g., [1]), where the probability distributions are defined by the arguments, upper-case 
letters represent random variables and the corresponding lower-case letters represent realizations of the 
random variables. We will follow the convention of dropping subscripts of probability distributions if the 
arguments of the distributions are lower case versions of the corresponding random variables. The superscripts 
identify the number of samples to be included in a given vector, e.g., y{~ 1 = [y\ t \ ■ ■ • yij-i]. 

The rest of the paper is organized as follows. The system model is introduced in Section JI] In Section 
Hill we study the multiple access channel with a cognitive relay, and provide the capacity region for this 
model and several extensions. The compound multiple access channel with a relay is studied in Section ITVl 
in which inner and outer bounds are provided using decode-and-forward and compress-and-forward type 
relaying strategies. Section [V] is devoted to a special binary additive cMACr model. For this model, we 
characterize the capacity region and show that the linear binary block codes can achieve any point in the 
capacity region, while random coding based achievability schemes have suboptimal performance. In Section 
|VH we analyze Gaussian channel models for both the MAC with a relay setup and the general cMACr setup. 
We apply lattice coding/decoding for the cMACr and show that it improves the achievable symmetric rate 
value significantly, especially for the high power regime. Section IVIII concludes the paper followed by the 
appendices where we have included the details of the proofs. 
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II. System Model 

A compound multiple access channel with relay consists of three channel input alphabets X\, X 2 and 
X 3 of transmitter 1, transmitter 2 and the relay, respectively, and three channel output alphabets 3^1, 3^2 and 
3^ of receiver 1, receiver 2 and the relay, respectively. We consider a discrete memoryless time-invariant 
channel without feedback, which is characterized by the transition probability p(yi, y 2 , 2/3 ] 5 x 2, x 3 ) (see 
Fig. 0. Transmitter i has message Wi E Wi, i = 1, 2, while the relay terminal also has a message W3 G W3 
of its own, all of which need to be transmitted reliably to both receivers. Extension to a Gaussian model 
will be considered in Sec. [VI] 

Definition 2.1: A (2 nRl , 2 nR2 , 2 nR3 , n) code for the cMACr consists of three sets Wi = {1, . . . , 2 nR - } 
for i = 1,2, 3, two encoding functions /, at the transmitters, i = 1,2, 

/< : W< -» (1) 
a set of (causal) encoding functions gj at the relay, j = 1, . . . , n, 

9j ■ m x yi 1 -» Ar 3 , (2) 

and two decoding functions faj at the receivers, i = 1,2, 

fH : y? -> Wi x W 2 x W 3 . (3) 
We assume that the relay terminal is capable of full-duplex operation, i.e., it can receive and transmit at 
the same time instant. The joint distribution of the random variables factors as 

3 n 

PO{l,2,3}, ^{1,2,3}' V{1,2,3}) = Ylp( w i) ' P(^lMp(x^ \w 2 ) JJ p{x 3j |y£ -1 , W 3 )p(y {lt2 ,3}j k{l,2,3}j)- ( 4 ) 

i=l j=l 

The average probability of block error for this code is defined as 

[J {(Wxii), W 2 (i),W 3 (t)) + (W 1 ,W 2 ,W 3 )} . 

i=l,2 

Definition 2.2: A rate triplet {R\, R 2 , R3) is said to be achievable for the cMACr if there exists a 
sequence of (2 nRl , 2 nRa , 2 nR3 ,n) codes with P e n — ► as n — > cxd. 

Definition 2.3: The capacity region C for the cMACr is the closure of the set of all achievable rate 
triplets. 
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Fig. 3. MAC with a cognitive relay. 



III. MAC with a Cognitive Relay 

Before addressing the more general cMACr model, in this section we study the simpler MAC with 
a cognitive relay scenario shown in Fig. [3] This model, beside being relevant on its own, enables the 
introduction of tools and techniques of interest for the cMACr. The model differs from the cMACr in that 
the messages W\ and W 2 of the two users are assumed to be non-causally available at the relay terminal (in 
a "cognitive" fashion [13]) and there is only one receiver (y 2 = 3^3 = and y = 3l)- Hence, the encoding 
function at the relay is now defined as f 3 : Wj x W 2 x W3 — > X 3 , the discrete memory less channel is 
characterized by the conditional distribution p{y\x\,x 2 , x 3 ) and the average block error probability is defined 
accordingly for a single receiver. Several extensions of the basic model of Fig. [3] will also be considered in 
this section. The next proposition provides the capacity region for the MAC with a cognitive relay. 

Proposition 3.1: For the MAC with a cognitive relay, the capacity region is the closure of the set of all 
non-negative (R\, R 2 , R$) satisfying 

R 3 <I(X 3 ;Y\X 1 ,X 2 ,U 1 ,U 2 ,Q), (5a) 
R 1 + R 3 <I(X 1 ,X 3 ;Y\X 2 ,U 2 ,Q), (5b) 
R 2 + R 3 <I(X 2 ,X 3 ;Y\X 1 ,U 1 ,Q), (5c) 

and 

Ri + R 2 + R 3 <I(Xx,X 2 ,X 3 ;Y\Q) (5d) 
for some joint distribution of the form 

p(q)p(x 1 ,u 1 \q)p(x 2 ,u 2 \q)p(x 3 \u 1 ,u 2 ,q)p(y\xi,x 2 ,x 3 ) (6) 
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for some auxiliary random variables U\,U 2 and Q. 

Proof: A more general MAC model with three users and any combination of "common messages" 
(i.e., messages known "cognitively" to more than one user) is studied in Sec. VII of [14], from which 
Proposition 13. II can be obtained as a special case. However, since a proof is not provided in [14], and the 
technique developed here will be used in deriving other achievable regions in the paper, we provide a proof 
in Appendix |U ■ 

Towards the goal of accounting for non-ideal connections between sources and relay (as in the original 
cMACr), we next consider the cases of partial and limited-rate cognition (rigorously defined below). We 
start with the partial cognition model, in which the relay is informed of the message of only one of the two 
users, say of message W\. 

Proposition 3.2: The capacity region of the MAC with a partially cognitive relay (informed only of the 
message W\) is given by the closure of the set of all non-negative (Ri, R 2 , R 3 ) satisfying 

R 2 <I(X 2 ;Y\X 1 ,X 3 ,Q), (7a) 

R 3 <I(X 3 ;Y\X 1 ,X 2 ,Q), (7b) 

R 1 + R 3 <I(X 1 ,X 3 ;Y\X 2 ,Q), (7c) 

R 2 + R 3 <I(X 2 ,X 3 ;Y\X 1 ,Q), (7d) 

and 

Ri + R 2 + R 3 <I(X 1 ,X 2 ,X 3 ;Y\Q). (7e) 

for an input distribution of the form p(q)p(x 2 \q)p(xi, x 3 \q) . 

Proof: The proof can be found in Appendix ITT] ■ 
Remark 3.1: The capacity region characterization requires two auxiliary random variables in Proposition 
13.11 (and in [14]), while no auxiliary random variables are required in the formulation of Proposition 13.21 
This is because, in the scenario covered by Proposition 13.11 the relay's codeword can depend on both 
W\ and W 2 , and the auxiliary random variables quantify the amount of dependence on each message. 
On the contrary, for Proposition 13.21 the relay cooperates with only one source, and no auxiliary random 
variable is needed. To further elaborate on this point, another special case of the channel in Fig. [3] in 
which no auxiliary random variable is necessary to achieve the capacity region is obtained when each 
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transmitter is connected to the receiver via an orthogonal channel, i.e., we have Y = Y 2 ,Y 3 ) and 
p(yi, 2/2, V3\xi,X2, x 3 ) = T\i=i p(yi\%i) ■ In this case, unlike Proposition 13.21 the lack of auxiliary random 
variables reflects the fact that no coherent combining gain can be accrued via the use of the relay due to 
the channels' orthogonality. Defining Cj = max^ \ I(Xf,Yi), for i = 1,2,3, we obtain from Proposition 
ED that the capacity region is given by {(i?i, R 2 , R 3 ) : < i?i,0 < R 2 ,0 < R3 < C 3 ,R\ + R 2 < 
Ci + C 2 , R 2 +R 3 <C 2 + C 3 , R 1 + R 2 + R 3 <C 1 + C 2 + C 3 }. 

The model in Fig. [3]can be further generalized to a scenario with limited-capacity cognition, in which the 
sources are connected to the relay via finite-capacity orthogonal links, rather than having a priori knowledge 
of the terminals' messages. This channel can be seen as an intermediate step between the MAC with cognitive 
relay studied above and the multiple access channel with relay for which an achievable region was derived 
in [2] for the case R 3 = 0. In particular, assume that terminal 1 can communicate with the relay, prior to 
transmission, via a link of capacity C\ and that similarly terminal 2 can communicate with the relay via a 
link of capacity C 2 . The following proposition establishes the capacity of such a channel. 

Proposition 3.3: The capacity region of the MAC with a cognitive relay connected to the source terminals 
via (unidirectional) links of capacities C% and C 2 is given by 



Rt <I{X 1 ;Y\X 2 ,X 3 ,U 1 ,U 2 ,Q) + Ci, 

R2<I(Xr,Y\X 1 ,X 3 ,U 1 ,U 2 ,Q) + C 2 , 

R 3 <I{X 3 -Y\X 1 ,X 2 ,U 1 ,U 2 ,Q), 

Ri + R 2 < /(Xi,X 2 ;y|X 3 , Ut, U 2 , Q) + d + C 2 , 

I(X 1 ,X 3 ;Y\X 2 ,U 1 ,U 2 ,Q) + C 1 , 
I{X U X 3 -Y\X 2 ,U 2 ,Q) 

I{X 2 ,X 3 -Y\X U U U U 2 ,Q) + C 2 
I(X 2 ,X 3 -Y\X U U U Q) 



R\ + R 3 < min 



R 2 + R 3 < min 



(8a) 
(8b) 
(8c) 
(8d) 

(8e) 
(8f) 



and 



R\ + R 2 + R 3 < min < 



I(X 1 ,X 2 ,X 3 ;Y\U 1 ,U 2 ,Q) + C 1 + C 2 , 
I(X 1 ,X 2 ,X 3 ;Y\U 1 ,Q) + C 1 , 
I(X 1 ,X 2 ,X 3 ;Y\U 2 ,Q) + C 2 , 
I{X U X 2 ,X 3 -Y\Q) 



(8g) 
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for some auxiliary random variables U\, U2 and Q with joint distribution of the form Q. 
Proof: The proof is sketched in Appendix [TlTJ 



Remark 3.2: Based on the results of this section, we can now make a further step towards the analysis 
of the cMACr of Fig. [2] by considering the cMACr with a cognitive relay. This channel is given as 
in Fig. [2] with the only difference that the relay here is informed "for free" of the messages W\ and 
W 2 (similarly to Fig. [3) and that the signal received at the relay is non-informative, e.g., 3^2 = 0- The 
capacity of such a channel follows easily from Proposition 13.11 by taking the union over the distribution 
p(q)p(xx,ui\q)p(x 2 ,u 2 \q)p(x 3 \ui,u 2 ,q) p(yi, 2/2^1, %2, 23) of the intersection of the two rate regions ® 
evaluated for the two outputs Y\ and Y2. Notice that this capacity region depends on the channel inputs only 
through the marginal distributions p{yi\xi, X2, x 3 ) and p(y2\xi, x 2 , x 3 ). 

IV. Inner and Outer bounds on the Capacity Region of the Compound MAC with a Relay 

In this section, we focus on the general cMACr model illustrated in Fig. [2] As stated in Section [TJ single- 
letter characterization of the capacity region for this model is open even for various special cases. Our goal 
here is to provide inner and outer bounds, which are then shown to be tight in certain meaningful special 
scenarios. 

The following inner bound is obtained by the decode-and-forward (DF) strategy [1] at the relay terminal. 
The relay fully decodes both messages of both users so that we have a MAC from the transmitters to 
the relay terminal. Once the relay has decoded the messages, the transmission to the receivers takes place 
similarly to the MAC with a cognitive relay model of Section llTTl 

Proposition 4.1: For the cMACr as seen in Fig. |U any rate triplet (Ri, R2,R 3 ) with Rj > 0, j = 1, 2, 3, 
satisfying 



R 1 <I(X 1 ;Y 3 \U 1 ,X 2 ,X 3 ,Q) 



(9a) 



R 2 <I(X 2 ;Y 3 \U 2 ,X 1 ,X 3 ,Q) 



(9b) 



Rl + R 2 <I(X 1 ,X 2 ;Y 3 \U 1 ,U 2 ,X 3 ,Q), 



(9c) 



R 3 <mm{I(X 3 ;Y 1 \X 1 ,X 2 ,Ui,U2,Q), I(X 3 ; F 2 |^i,^2, Ui, U 2 , Q)} 



(9d) 



iii + R 3 < min{/(X 1 , X 3 ; Yi\X 2 , U 2 , Q), I{Xx,X 3 ; Y 2 \X 2 , U 2 , Q)} 



(9e) 



R2 + ^3 < min{/(X 2 ,X 3 ;Y 1 \X 1 ,U 1 ,Q), I{X 2 ,X 3 ;Y 2 \X 1 ,U 1 ,Q)} 



(9f) 
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and 

R 1 +R 2 + R 3 <min{I(X 1 ,X 2 ,X 3 ;Y 1 \Q), I(X 1 ,X 2 ,X 3 ;Y 2 \Q)} (9g) 
for auxiliary random variables U\, U2 and Q with a joint distribution of the form 

p{q)p{x 1 , ui I q)p{x 2 ,u 2 \q)p{x 3 \ux y u 2 ,q)p{yi,y 2 ,y 3 \xx,x 2 ,x 3 ) (10) 

is achievable by DF. 

Proof: The proof follows by combining the block-Markov transmission strategy with DF at the relay 
studied in Sec. IV-D of [2], the joint encoding used in Proposition 13. II to handle the private relay message 
and backward decoding at the receivers. Notice that conditions (l9al)-(l9cT) ensure correct decoding at the 
relay, whereas (|9dl)-([9g]) follow similarly to Proposition 13.11 and Remark 13.21 ensuring correct decoding of 



the messages at both receivers. ■ 
Next, we consider applying the compress-and-forward (CF) strategy [1] at the relay terminal. With CF, 
the relay does not decode the source message, but facilitates decoding at the destination by transmitting a 
quantized version of its received signal. In quantizing its received signal, the relay takes into consideration 
the correlated received signal at the destination terminal and applies Wyner-Ziv source compression (see [1] 
for details). In the cMACr scenario, unlike the single-user relay channel, we have two distinct destinations, 
each with different side information correlated with the relay received signal. This situation is similar to 
the problem of lossy broadcasting of a common source to two receivers with different side information 
sequences considered in [16] (and solved in some special cases), and applied to the two-way relay channel 
setup in [7]. Here, for simplicity, we consider broadcasting only a single quantized version of the relay 
received signal to both receivers. The following proposition states the corresponding achievable rate region. 

Proposition 4.2: For the cMACr of Fig. H any rate triplet (Ri,R 2 ,R 3 ) with Rj > 0, j = 1,2,3, 
satisfying 

R 1 <mm{I(X 1] Y 1 ,Y 3 \X 2 ,X 3 ,Q),I(X 1 ;Y 2 ,Y 3 \X 2 ,X 3 ,Q)}, (11) 
R 2 <min{I(X 2 -Y 2 ,Y 3 \X 1 ,X 3 ,Q),I{X 2] Y 1 ,Y 3 \X 1 ,X 3 ,Q)}, (12) 



and 



R 1 + R 2 <mm{I(X 1 ,X 2 ;Y 1 ,Y 3 \X 3 ,Q) 1 I(X 1 ,X 2 ;Y 2 ,Y 3 \X 3 ,Q)} (13) 
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such that 

R 3 +I(Y 3 ;Y 3 \X 3 ,Y 1 ,Q)<I(X 3 ;Y 1 \Q) (14) 

and 

R 3 + I(Y 3 ;Y 3 \X 3 ,Y 2 ,Q)<I(X 3 ;Y 2 \Q) (15) 
for random variables Y 3 and Q satisfying the joint distribution 

p{q,x 1 ,x 2 ,x 3 ,y 1 ,y 2 ,y 3 ,y 3 ) = p(q)p(xi\q)p(x 2 \q)p(x 3 \q)p(y 3 \y 3 ,x 3 ,q)p(y 1 ,y 2 ,y 3 \x 1 ,x 2 ,x 3 ) (16) 

is achievable with Y 3 having bounded cardinality. 

Proof: The proof can be found in Appendix [TV] ■ 

Remark 4.1: The achievable rate region given in Proposition 14.21 can be potentially improved. Instead of 
broadcasting a single quantized version of its received signal, the relay can transmit two descriptions so that 
the receiver with an overall better quality in terms of its channel from the relay and the side information 
received from its transmitter, receives a better description, and hence higher rates (see [16] and [7] for 
details). Another possible extension which we will not pursue here is to use the partial DF scheme together 
with the above CF scheme similar to the coding technique in [7]. 

We are now interested in studying the special case in which each source terminal can reach only one of 
the destination terminals directly. Assume, for example, that there is no direct connection between source 
terminal 1 and destination terminal 2, and similarly between source terminal 2 and destination terminal 1. 
In practice, this setup might model either a larger distance between the disconnected terminals, or some 
physical constraint in between the terminals blocking the connection. Obviously, in such a case, no positive 
multicasting rate can be achieved without the help of the relay, and hence, the relay is essential in providing 
coverage to multicast data to both receivers. We model this scenario by the following (symbol-by-symbol) 
Markov chain conditions: 

Y 1 -(X 1 ,X 3 )-X 2 and (17a) 
Y 2 -(X 2 ,X 3 )-X 1 , (17b) 

which state that the output at receiver 1 depends only on the inputs of transmitter 1 and the relay (117al) . 
and similarly, the output at receiver 2 depends only on the inputs of transmitter 2 and the relay dl7b| ). The 
following proposition provides an outer bound for the capacity region in such a scenario. 
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Proposition 4.3: Assuming that the Markov chain conditions ( fT71 ) hold for any channel input distribution 
satisfying (01), a rate triplet (R\ , R 2 , R 3 ) with Rj > 0, j = 1, 2, 3, is achievable only if 

Ri < I(X x ;Y 3 \U h X 2 ,X 3 ,Q), (18a) 

R 2 <I(X 2 ;Y 3 \U 2 ,X 1 ,X 3 ,Q), (18b) 

R 3 < mm{I(X 3 ;Y 1 \X 1 ,X 2 ,U 1 ,U 2 ,Q), I(X 3 ; Y 2 \X 1 ,X 2 , Ui, U 2 , Q)}, (18c) 

R 1 + R 3 <mm{I(X 1 ,X 3 ;Y 1 \U 2 ,Q), I(X 3 ;Y 2 \X 2 ,U 2 ,Q)}, (18d) 

R 2 + R 3 <mm{I(X 3 ;Y 1 \X 1 ,U 1 ,Q), I(X 2 ,X 3 ;Y 2 \U 1 ,Q)} (18e) 

and 

^1 + ^2 + ^3 <min{Jpfi,X 3 ; *i 1 <9), /(X 2 ,X 3 ;y 2 |Q)} (18f) 
for some auxiliary random variables C7i, [72 and Q satisfying the joint distribution 

p(q)p(x 1 ,u 1 \q)p(x 2 ,u 2 \q)p(x 3 \u 1 ,u 2 , q)p(yi, y 2 , y 3 \xi, x 2 , x 3 ). (19) 
Proof: The proof can be found in Appendix [V] ■ 

By imposing the condition (fTTT ) on the DF achievable rate region of Proposition 14.11 it can be easily 
seen that the only difference between the outer bound (TT8T ) and the achievable region with DF (O is that 
the latter contains the additional constraint (|9cT >. which generally reduces the rate region. The constraint (l9cT > 
accounts for the fact that the DF scheme leading to the achievable region (© prescribes both messages W\ 
and W 2 to be decoded at the relay terminal. The following remark provides two examples in which the 
DF scheme achieves the outer bound (fT8l ) and thus the capacity region. In both cases, the multiple access 
interference at the relay terminal is eliminated from the problem setup so that the condition d9cT ) does not 
limit the performance of DF. 

Remark 4.2: In addition to the Markov conditions in ( fTTT ), consider orthogonal channels from the two 
users to the relay terminal, that is, we have Y 3 = (131,132), where Y 3 k depends only on inputs X^ and X 3 
for k = 1, 2; that is, we assume X\ — {X 2 ,X 3 ) — Y 32 and X 2 — {X\,X.z) — Y 3 \ form Markov chains for any 
input distribution. Then, it is easy to see that the sum-rate constraint at the relay terminal is redundant and 
hence the outer bound in Proposition 14.31 and the achievable rate region with DF in Proposition 14.11 match, 
yielding the full capacity region for this scenario. As another example where DF is optimal, we consider 
a relay multicast channel setup, in which a single relay helps transmitter 1 to multicast its message W\ to 
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both receivers, i.e., R2 = R3 = and X2 = 0. For such a setup, under the assumption that X\ — X3 — Y2 
forms a Markov chain, the achievable rate with DF relaying in Proposition 14.11 and the above outer bound 
match. Specifically, the capacity C for this multicast relay channel is given by 

C= max min{J(Xi;y 3 |X 3 ), I{Xi,X^Y x ), I(X 3 ;Y 2 )}. (20) 

p{Xi,X 3 ) 

Notice that, apart from some special cases (like the once illustrated above), the achievable rate region with 
DF is in general suboptimal due to the requirement of decoding both the individual messages at the relay 
terminal. In fact, this requirement may be too restrictive, and simply decoding a function of the messages 
at the relay might suffice. To illustrate this point, consider the special case of the cMACr characterized by 
Xi = (Xi i, Xi 2), Yi = (Y{ 1,3^ 2) an d = Xi t i for i = 1,2 and the channel given as 

P(yi,V2, 2/3) = P{y3\xi,2,X2,2)p(yi,l\xi ; l)p(y2,l\x2,l)p{yi,2,y2,2\x3)- 

In this model, each transmitter has an error-free orthogonal channel to its receiver. By further assuming 
that these channels have enough capacity to transmit the corresponding messages reliably (i.e., message i is 
available at receiver i), the channel at hand is seen to be a form of the two-way relay channel. In this setup, 
as shown in [21], [22], [23] and [7], DF relaying is suboptimal while using a structured code achieves the 
capacity in the case of finite field additive channels and improves the achievable rate region in the case of 
Gaussian channels. In the following section, we explore a similar scenario for which the outer bound (Q3 
is the capacity region of the cMACr, which cannot be achieved by either DF or CF. 

V. Binary cMACr: Achieving Capacity Through Structured Codes 

Random coding arguments have been highly successful in proving the existence of capacity-achieving 
codes for many source and channel coding problems in multi-user information theory, such as MACs, BCs, 
RCs with degraded signals and Slepian-Wolf source coding. However, there are various multi-user scenarios 
for which the known random coding-based achievability results fail to achieve the capacity, while structured 
codes can be shown to perform optimally. The best known such example for such a setup is due to Korner 
and Marton [17], who considered encoding the modulo sum of two binary random variables. See [20] for 
more examples and references. 

Here, we consider a binary symmetric (BS) cMACr model and show that structured codes achieve its 
capacity, while the rate regions achievable with DF or CF schemes are both suboptimal. We model the BS 
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cMACr as follows: 

Y 1 = X 1 ®X 3 ®Z 1 , (21a) 

Y 2 = X 2 ®X 3 ®Z 2 , and (21b) 

Y 3 = X 1 ®X 2 ®Z 3 (21c) 

where © denotes binary addition, and the noise components Z\ are independent identically distributed (i.i.d.) 
witrQ B(e%), i = 1,2,3, and they are independent of each other and the channel inputs. Notice that this 
channel satisfies the Markov condition given in (fTTT ). We assume that the relay does not have a private 
message, i.e., R 3 = 0. The capacity region for this BS cMACr, which can be achieved by structured codes, 
is characterized in the following proposition. 

Proposition 5.1: For the binary symmetric cMACr characterized in (|2TT ). the capacity region is the union 
of all rate pairs {R\,R 2 ) satisfying 

Ri<l- H b (e 3 ), (22a) 
Ra < 1 - H b (e 3 ) and (22b) 
Rx+R 2 < min{l - H b ( £l ), 1 - H b (e 2 )}, (22c) 

where H b (e) is the binary entropy function defined as H b (e) = — eloge — (1 — e) log(l — e). 

Proof: The proof can be found in Appendix [Vl] ■ 
For comparison, the rate region achievable with the DF scheme given in (© is given by (|22l with the 
additional constraint 

R 1 + R 2 <1-H b (e 3 ), 

showing that the DF scheme achieves the capacity (1221 only if £3 < min{ei,£2}. The suboptimality of DF 
follows from the fact that the relay terminal needs to decode only the binary sum of the messages, rather 
than the individual messages sent by the source terminals. In fact, in the achievability scheme leading to 
(1221 . the binary sum is decoded at the relay and broadcast to the receivers, which can then decode both 
messages using this binary sum. 

l X ~ 13(e) denotes a Bernoulli distribution for which p(X = 1) = s and p(X = 0) = 1 — e. 
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VI. Gaussian Channels 

In this section, we focus on the Gaussian channel setup and find the Gaussian counterparts of the rate 
regions characterized in Section [III] and Section [W] We will also quantify the gap between the inner and 
outer bounds for the capacity region of the cMACr proposed in Section |lVl As done in Sec. [TTTl we first 
deal with the MAC with a cognitive relay model. 

A. Gaussian MAC with a Cognitive Relay 

We first consider the Gaussian MAC with a cognitive relay setup. The multiple access channel at time 
i, i = 1, . . . , n, is characterized by the relation 

Yi = X u + X 2i + X 3i + Z u (23) 

where Z\ is the channel noise at time i, which is i.i.d. zero-mean Gaussian with unit variance. We impose 
a separate average block power constraint on each channel input: 

1 n 

-»- v "> ( 24 ) 

i=l 

for j = 1, 2, 3. The capacity region for this Gaussian model can be characterized as follows. 

Proposition 6.1: The capacity region of the Gaussian MAC with a cognitive relay d23l with power 
constraints (l24l ) is the union of all rate triplets (R\, R 2 , R3) satisfying 

R 3 < 1 log(l + (1 - 4 - 4')P 3 ), (25a) 

Ri + R3 < \ log(l + Pi + (1 - a%)P 3 + 2yJa , 3 P 1 P 3 ), (25b) 

R2 + Rs < \ log(l + Pi + (1 - a' 3 )P 3 + 2^a' 3 'P 2 P 3 )) (25c) 

and 

i? x + R 2 + P 3 < ^ log(l + P1+P2 + P3 + 2 y /a , 3 P 1 P 3 + 2^a / 3 , P 2 P 3 ), (25d) 

where the union is taken over all parameters < a! 3 , a 3 < 1 and a 3 + a 3 ' < 1. 

Proof: The proof can be found in Appendix IVIIII ■ 
Notice that a 3 and a 3 in d25l ) represent the fraction of total power invested by the cognitive relay to help 
transmitter 1 and transmitter 2, respectively. Next, we present the capacity region for the Gaussian partially 
cognitive relay setup of (0. 
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Proposition 6.2: The capacity region of the Gaussian MAC with a partially cognitive relay (informed 
only of the message W\) is given by 

P 2 < ilog(l + P 2 ), (26a) 

P 3 < Ilog(l + (l-p 2 )P 3 ), (26b) 

Pi + Rs < \ log(l + Pi + P 3 + 2p/P^) , (26c) 

P 2 + P 3 < 7^ log(l + P 2 + (1 - p 2 )P 3 ) (26d) 

and 

Pi + R 2 + P 3 < \ log(l + Pi + P 2 + P 3 + 2/jTpP^) (26e) 

with the union taken over the parameter < p < 1. 

Proof: The result follows straightforwardly from ([7]) and the conditional maximum entropy theorem 
by defining p as the correlation coefficient between X\ and X 3 . ■ 

Notice that, the same arguments as above can also be extended to the MAC with cognition via finite- 
capacity links of Proposition 13.31 

1) Numerical Examples: For clarity of the presentation we consider P 3 = 0. In this case, it is clear 
that the choice a' 3 + a(( = 1 is optimal for (1231 ) and p = 1 is optimal in (l26b - Fig. |4] shows the capacity 
regions with full or partial cognition ( d25T ) and (1261 ). respectively) for Pi = P 2 = 3 dB and for different 
values of P 3 , namely P 3 = —6 dB and 3 dB. It can be observed from Fig. |4] that, even with a small power 
P 3 , a cognitive relay has the potential for significantly improving the achievable rate regions. Moreover, in 
the partially cognitive case, this advantage is accrued not only by the transmitter that directly benefits from 
cognition (here transmitter 1) but also by the other transmitter (transmitter 2), due to the fact that cognition 
is able to boost the achievable sum-rate (see (I26el l). 

We now consider a typical cognitive radio scenario where the two "primary" users, transmitter 1 and 
transmitter 2, transmit at rates Pi and R 2 , respectively, within the standard MAC capacity region with no 
relay (i.e., (Ri,R 2 ) satisfy (1231 ) with P 3 = and P 3 = 0) and are oblivious to the possible presence of 
a cognitive node transmitting to the same receiver. By assumption, the cognitive node can rapidly acquire 
the messages of the two active primary users (exploiting the better channel from the primary users as 
compared to the receiver) and is interested in transmitting at the maximum rate P 3 that does not affect 
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Fig. 4. Capacity regions of the Gaussian MAC with a cognitive relay with full or partial cognition fl !25t and l !26t . respectively) 
for Pi = P2 — 3 dB and for different values of P3, namely P3 = — 6 dB and 3 dB. 



the rates achievable by the primary users. In other words, the rate R3 is selected so as to maximize R% 
under the constraint that (R\, R2, R3) still belongs to the capacity region (the one characterized by d25l ) 
for full cognition and by (|26l ) for partial cognition). Fig. [5] shows such a rate i? 3 for both full and partial 
cognitive relays for P\ = P% = 3dB and two different primary rate pairs, namely R\ = R2 = 0.3 and 
R\ = i?2 = 0.55 (which is close to the sum-rate boundary as shown in Fig. [5]). It is seen that both full and 
partial cognition afford remarkable achievable rates even when the primary users select rates at the boundary 
of their allowed rates. 
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P 3 [dB] 

Fig. 5. Maximum rate R3 that does not affect the rates achievable by the primary users 1 and 2 for Pi = P2 ~ 2>dB and 
Ri = R2 = 0.3 or Ri = R2 = 0.55 (R3 is the maximum relay rate so that (Ri, R2, R3) still belongs to the capacity region ( j25l 
for full cognition and ([26} for partial cognition). 



B. Gaussian Compound MAC with a Relay 

A Gaussian cMACr satisfying the Markov conditions (ITTb is given by 

Y 1 = X 1 + 7]X 3 + Z 1 (27a) 
Y 2 = X 2 + V X 3 + Z 2 (27b) 
Y 3 = 7 (X! + X 2 ) + Z 3 , (27c) 

where 7 > is the channel gain from the users to the relay and 77 > is the channel gain from the 
relay to both receiver 1 and receiver 2. The noise components Zi, i = 1,2,3 are i.i.d. zero-mean unit 
variance Gaussian random variables. We enforce the average power constraints given in (l24l . Considering 
for simplicity the case R 3 = 0, we have the following result. 
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Proposition 6.3: The following rate region is achievable for the Gaussian cMACr characterized by d2Tb 
by using the DF strategy: 

' \ log (i + f^fi-^L 



R\ < min 



i?2 < m i n * 



1— a 2 a3 

1 log (1 + P 1 + r, 2 P 3 (l -a'D) 
|log(l + 7 2 P 2 (l-i^ 

| log (1 + P 2 +77^3(1-^)) 



(28a) 



(28b) 



and 



R\+ R.2 < min < 



i log [l + 7 2 (P! + P 2 ) [1 p^jr- 

\ log (l + Pi + r/ 2 P 3 + 2r ? V«ia 3 P i P 3 



(28c) 



_ i log (l + P 2 + r/ 2 P 3 + 2r ]y / ai a , iP 2 P3 
with the union taken over the parameters < ai,a2,a' 3 ,a' 3 ' < 1 and a 3 + a 3 < 1. Moreover, an outer 
bound to the capacity region is given by d28l) without the first sum-rate constraint in (I28cl) . 

Proof: It is enough to prove that jointly Gaussian inputs are sufficient to exhaust the DF achievable 
region (O and the outer bound (TT8T ). This can be done similarly to Proposition 16. II Then, setting the random 
variables at hand as (|741)-(|75T) (see Appendix IVIIII ) in © and (fT8l) and after some algebra the result can be 
derived. ■ 
It is noted that, similarly to (|25T ). in (|28T ) the parameters a' 3 and a' 3 ' represent the fractions of power that 
the relay uses to cooperate with transmitter 1 and 2, respectively. Moreover, the first term in each of the 
three min{-} functions correspond to the condition that the relay is able to decode the two messages, while 
the other terms refer to constraints on decoding at the two receivers. 

Next, we characterize the achievable rate region for the Gaussian setup with the CF strategy of Proposition 

Here, we assume a Gaussian quantization codebook without claiming optimality. 
Proposition 6.4: The following rate region is achievable for the Gaussian cMACr (|27T ): 



and 



2 8 V l + N a 



U s 1 1 fl , 7 2 "2P2 

P2 < - log 1 + 



1 + N a 



(29a) 



(29b) 
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where 

_ 1 + 7 2 (aiPia2-P2 + oiiPx + CK2P2) + min {aiP\, (X2P2} 

1 ~ 2d ' 

for all < 014 < 1, i = 1,2. 

1 ) Using Structured Codes: In Sec. |VJ we have shown that for a binary additive compound MAC with 
a relay, it is optimal to use structured (block linear) codes rather than conventional unstructured (random) 
codes. The reason for this performance advantage is that linear codes, when received by the relay over an 
additive channel, enable the latter to decode the sum of the original messages with no rate loss, without 
requiring joint decoding of the messages. Here, in view of the additive structure of the Gaussian channel, 
we would like to extend the considerations of Sec. [V] to the scenario at hand. For simplicity, we focus on a 
symmetric scenario where Pi = P2 = P3 = P, R\ = R2 = R (and R% = 0). Under such assumptions, the 
outer bound of Proposition IVI-BI sets the following upper bound on the equal rate R (obtained by setting 
a' 3 = «3 = 03 and o\ = 02 = a in d28V): 

R< max min { \ log ( 1 + 7 2 P ( - - 2a ° 3 ))h og (l + P + r? 2 P(l - a 3 )) , 

0<a<l 12 V VI - Q«3 I 

0<a 3 <l 

ilog(l + P(l + r ? 2 + 2r ? V^i))}, (30) 

whereas the rate achievable with DF is given by the right hand side (l30l with an additional term in min{-} 
given by 1/4 • log (l + 2j 2 P (1 — 200:3)) . The rate achievable by CF can be similarly found from ( f29T > by 
setting a\= 02 = a and maximizing over < a < 1. 

As is well known, the counterpart of binary block codes over binary additive channels in the case of 
Gaussian channels is given by lattice codes which can achieve the Gaussian channel capacity in the limit of 
infinite block lengths (see [18] for further details). A lattice is a discrete subgroup of the Euclidean space 
EL n with the vector addition operation, and hence provides us a modulo sum operation at the relay terminal 
similar to the binary case. 

For the Gaussian cMACr setting given in (l27l) . we use the same nested lattice code at both transmitters. 
Similar to the transmission structure used in the binary setting, we want the relay terminal to decode only 
the modulo sum of the messages, where the modulo operation is with respect to a coarse lattice as in [22], 
whereas the messages are mapped to a fine lattice, i.e., we use the nested lattice structure as in [18]. The 
relay terminal then broadcasts the modulo sum of the message points to both receivers. Each receiver decodes 
the message from the transmitter that it hears directly and the modulo sum of the messages from the relay 
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as explained in Appendix IVIII Using these two, each receiver can also decode the remaining message. We 
have the following rate region that can be achieved by the proposed lattice coding scheme. 

Proposition 6.5: For the symmetric Gaussian cMACr characterized by d27l ), an equal rate R can be 
achieved using a lattice encoding/decoding scheme if 

R < min {- log Q + 7 2 p) , ~ log (l + P min{l, V 2 }) , ~ log(l + P(l + r? 2 ))} . (31) 
Proof: The proof can be found in Appendix IVIII ■ 

Remark 6.1: Achievability of (|3"TT ). discussed in Appendix IVIII requires transmission at rates correspond- 
ing to symmetric rate point on the boundary of the MAC regions from each transmitter and the relay to 
the corresponding receiver. However, here, of the two senders over each MAC, one sender employs lattice 
coding (the sources), so that the standard joint typicality argument fails to prove achievability of these rate 
points. The problem is solved by noticing that, even in this scenario, it is straightforward to operate at the 
corner points of the MAC region by using single user encoding and successive decoding. Now, in general, 
two different techniques are possible to achieve any boundary rate point by using only transmission at the 
corner-point rates, namely time-sharing and rate-splitting [27]. In our case, it can be seen that time-sharing 
would generally cause a rate reduction with respect to d3"TT ), due to the constraint arising from decoding at 
the relay. On the contrary, rate-splitting does not have such a drawback: the relay terminal splits its message 
and power into two parts and acts as two virtual users, while single-user coding is applied for each virtual 
relay user as well as the message from the transmitter. Since lattice coding achieves the optimal performance 
for single user decoding, we can achieve any point on the boundary of the MAC region. 

2) Numerical examples: Consider cMACr with powers P\ = P2 = P3 = 5 dB and channel gain rj 2 = 10 
from the relay to the two terminals. Fig. [6] shows the achievable rate region and outer bound for different 
values of the channel gain from the terminals to the relay, namely 7 2 = 1 and 7 2 = 5. It can be seen that, 
if the channel to the relay is weak, then CF improves upon DF at certain parts of the rate region. However, 
as 7 2 increases, DF gets very close to the outer bound dominating the CF rate region, since the sum rate 
constraint in DF scheme becomes less restricting. 

In Fig. |7J the equal rate achievable with lattice codes (OTT ) is compared with the upper bound (l30l ) and 
the symmetric rates achievable with DF and CF for 7 2 = 1/10 and rj 2 = 10 versus P\ = P^ = P3 = P. We 
see that, for sufficiently large P, the lattice-based scheme is close to optimal, whereas for smaller P, CF or 
DF have better performance. The performance loss of lattice-based schemes with respect to the upper bound 
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0.5 1 1.5 2 

R 1 

Fig. 6. Achievable rate region and outer bound for Pi = P2 = P 3 = 5 dB, rf = 10 and different values of the channel gain 
from the terminals to the relay, namely j 2 — 1,5. 

is due to the fact that lattice encoding does not enable coherent power combining gains at the destination. 
It is also noted that both DF and lattice-based schemes have the optimal multiplexing gain of 1/2 (in terms 
of equal rate). 

VII. Conclusions 

We have considered a compound multiple access channel with a relay terminal. In this model, the relay 
terminal simultaneously assists both transmitters while multicasting its own information at the same time. 
We first have characterized the capacity region for a multiple access channel with a cognitive relay and 
related models of partially cognitive relay and cognition through finite capacity links. We then have used the 
coding technique that achieves the capacity for these models to provide an achievable rate region with DF 
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P (dB) 

Fig. 7. Equal rate achievable with lattice codes J3U compared with the upper bound d30b and the rates achievable with DF and 
CF for 7 2 = 1/10 and rf = 10 versus Pi = P 2 = P 3 = P- 

relaying in the case of a general cMACr. We have also considered a CF based relaying scheme, in which 
the relay broadcasts a compressed version of its received signal using the received signals at the receivers as 
side information. Here we have used a novel joint source-channel coding scheme to improve the achievable 
rate region of the underlying multi-user channel coding problem. 

We then have focused on another promising approach to improve rates in certain multi-user networks, 
namely using structures codes, rather than random coding schemes. We have proved that the capacity can be 
achieved by linear block codes in the case of finite field additive channels. Motivated by the gains achieved 
through such structured coding approaches, we have then analyzed the performance of nested lattice codes 
in the Gaussian setting. Our results show that lattice coding achieves rates higher than other random coding 
schemes for a wide range of power constraints. We have also presented the achievable rate regions with 
the proposed random coding schemes, and provided a comparison. Our analysis has revealed that no single 
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coding scheme dominates all the others uniformly over all channel conditions. Hence a combination of 
various random coding techniques as well as structured coding might be required to improve the achievable 
rates or to meet the upper bounds in a general multi-user network model. 

Appendix I 
Proof of Proposition s. II 

A. Types and Typical Sequences 

Here, we briefly review the notions of types and strong typicality that will be heavily used in the proofs. 
See [26] for further details. The type P x n of an re-tuple x n is the empirical distribution 

P x n = -N(a\x n ) 
n 

where N(a\x n ) is the number of occurrences of the letter a in vector x n . The set of all ra-tuples x n with 
type Q is called the type class Q and denoted by Tq. For a probability distribution px, the set of e-strongly 
typical re-tuples according to px is denoted by Tr-fa and is defined by 

1 



n 



-N{a\x n )-p x {a) 



< e, Va E X and N(a\x n ) = whenever px{x) = > . 



The definitions of type and strong typicality can be extended to joint and conditional distributions in a 
similar manner [26]. The following results concerning typical sets will be used in the sequel. For any e > 0, 
we have 

' < e (32) 



1 log\T( xh \-H(X) 



n 
and 

Pv(X n eT^ x] J>l-e (33) 

for sufficiently large n. Given a joint distribution pxy, if the i.i.d. sequences (x n ,y n ) ~ p\Py, where P x 
and Py are n-fold products of the marginals px and py, then 

Pi{(x n ,y n ) E T[ XY]< ] < 2' n( - I ( X ' Y ^\ 
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B. Converse 

Starting from the Fano inequality, imposing the condition P™ — > as n — > oo, we have 

tf^W^l^^n (34) 
with <5 n — ► as n — > oo. Then we also have H(W\, W 3 \Y n , W 2 ) < n<5 n . We can obtain 

ra(i?i+i? 3 ) =fT(Wi,W 3 ) = H(W 1 ,W 3 \W 2 ) < I (WuW 3 ;Y n \W 2 ) + n5 n (35) 

n 

= ^ J H"(y i |W r 2 ,^ 1 ) - H(Y i \W 1 ,W 2 ,W 3 ,Y i ~ 1 ) + n5 n (36) 

i=l 



= Y^H{Y l \ X 2i , W 2 , Y i ~ 1 ) - H(Yi\X u , X 2i , X 3i , W l , W 2 , W 3 , Y^ 1 ) + n,5 n (37) 
i=l 

n 

= H{Yi\ X 2i , C7 2i ,y i_1 ) - H(Yi\ X U , , X 3i , C/ H , U 2i ) + n5 n (38) 
i=l 

n 

< ^ (K^, U 2i ) - H(Yi\X u ,X 2i ,X 3i , U u , U 2i ) + n6 n (39) 
i=i 

n 

= ^ J(X U , C/ii , X 3t ; | X 2l ,U 2i ) + n8 n (40) 



i=i 



= Y, I (Xii,X 3l] Y l \X 2u U 2l ) + n5 n (41) 
i=i 

where in (1371 ) we have used the fact that the codewords are function of the messages, in d38l ) we have 
defined Uu — W\ and C/ 2 j = W2 and used the fact that Y 1 ^ 1 — {Xu, X 2 i, X 3 i) — Yi forms a Markov chain, 
and the last equality follows from the Markov chain relationship (Uu,U 2 i) — (Xu, X 2 i, X 3 i) — Y^. 
We can similarly obtain 

n 

n{R 2 + R 3 )<Y J I{X 2l ,X 3i -Yi\X u , Uu) + n5 n 
i=l 

starting from n(R 2 + R 3 ) < I(W 2 , W 3 ; Y n \ W\) + n5 n (which follows from the Fano inequality (|34l ) since 
it implies H(W 2 , W 3 \Y n , W x ) < n5 n ) and 

n 

nR 3 < ^ I{X 3i ;Yi\Xu, X 2i ,Uu,U 2i ) + n5 n 
i=l 
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from the inequality nR 3 < I(W 3 ;Y n \W 1 ,W 2 ) + nd n (which follows from @D as H(W 3 \Y n ,Wi,W 2 ) < 
nS n ). From (l34l) . we also have: 

n(R 1 + R 2 + R 3 )<I(W 1 ,W 2 ,W 3 ;Y n )+n5 n 

n 

= Y,H(Y i \Y l - 1 )-H{Y i \W 1 ,W 2 ,W 3 ,Y i ~ 1 )+n5 n 

i=l 
n 

= Y,H{Y i \Y i - 1 )-H(Y i \X li ,X 2U X 3i ,W l ,W 2 ,W 3 ,Y i - 1 ) + n5 n 
i=i 

n 

< H(Y) - H{Yi\X u ,X 2i ,X 3i ) + n5 n = 
i=i 

n 

= I{Xu,X 2i ,X 3l - Y) + nb n . 
-i=i 

Now, introducing the time-sharing random variable Q independent from everything else and uniformly 
distributed over {1, .., n} and defining Xj = Xjq for j = 1, 2, 3, Y = Yq and Uj = UjQ for j = 1, 2, we 
get (f5]). Notice that the joint distribution satisfies ©. 

C. Achievability 

Code Construction: Generate an i.i.d. sequence Q n with marginal p{q) for i = 1, 2, n. Fix a realization 
of such a sequence Q n = q n . Generate 2 nR] codewords Uj(wj), Wj = 1, 2, 2 nRj also i.i.d. with probability 
distribution Yi7=i P( u ji\li)^ f° r 3 = 1, 2. For each pair w\,w 2 , generate 2 nR3 codewords i.i.d. according 
to Yli=iP( x 3i\ u ii( w i)i u 2i{w 2 ), qi), and label these codewords as x^(wi,w 2 ,w 3 ) for w 3 6 [l,2 ni?3 ]. Also 
generate 2 nR] codewords x"j(wj), j = 1,2, i.i.d. with probability distribution Yli=iP(. x ji\ u ji( w j)) < li) an( ^ 
label them as xj(wj) for Wj G [l,2 nRj ]. 

Encoders: Given (1^1,^2,^3), encoder j transmits x^{wj),j = 1,2, and encoder 3 transmits 23 (wi, 102, w 3 ). 

Decoders: The decoder looks for a triplet (tSi, ^2, w 3 ) such that 

( 9 n , < (t«i ) , (U» 2 ) , 2? (t&i ) , 2# (tfo ) , X^ (wi,W 2 ,W 3 ),y n ) G T[™ ^ ^ j^jfcx.y] . • 

If none or more than one such triplet is found, an error is declared. 

Error analysis: Assume (wi,w 2 ,w 3 ) = (1,1,1) was sent. We have an error if either the correct 
codewords are not typical with the received sequence or there is an incorrect triplet of messages whose 
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corresponding codewords are typical with the received sequence. Define the event (conditioned on the 
transmission of (w\,W2, W3) = (1, 1, 1)) 

E k , l>m = {(Q n ,Um^nk),U^l),X^l),X^k,l,m),Y n ) £ T? QUlUaX ^ Xa y].}- 
From the union bound, the probability of error, averaged over the random codebooks, is found as 

P e n = Pr(^ (1>1 ) \J U {k>ljmMltl>1) E kt i !m , 

<Pr(£ 1 c jl)1 ) + Yl Pr(E kXm )+ Yl Pr(^ Um )+ J2 Pr(£ fc ,*, m ). 

(fe,m)^(l,l),i=l (i,m)^(l,l),fc=l k^l,tyl,m£l 

From (I33T ). Pr(E , J x x ) — > as n — > 00. We can also show that for (k, m) ^ (1, 1), I = 1, 

< 2-n(/(X 1 ,C/i,X 3 ;y|X 2 ,C/ 2 ,Q)-3e) 
= 2-n(/(X 1 ,X 3 ;r|X 2 ,C/ 2 ,Q)-3e) 

Similarly, for (Z, m) 7^ (1, 1) and k = 1, we have 

P(£ M , m ) =Pr((9 n ,«?(l),«J(Z),x?(l),x5(0,xJ(l,Z,m), 1 / n ) £ T%j UlUaXlXaXaYl .) 

< 2~n(I(X 2 ,X 3 ;Y\X 1 ,U 1 ,Q)-3e) 

The third error event occurs for k ^ 1, 1 ^ 1, m ^ 1, and we have if 

P(£ M , m ) =Pr((?«, «?(*;), t^(0, xf(fc),a;5(0,xJ(fc,Z,m), 1 / n ) £ Iftt^x^y].) 

< 2-n(/(X 1 ,X 2 ,X 3 ;Y|Q)-4e) 
Then, it follows that 

P n <Vt{E{ 1 x ) + 2 n( - Rl+R ^2^ n( ' I( - XuX3 ' Y ^ X2 ' U2,Q ^~ 3 ^ + 2 n( - R2+R ^2~ n ( I( - X2,X3 ' Y \ XuUl ' Q ^~ 3 ^ 

Letting e — ► and n — > 00, we have a vanishing error probability given that the inequalities in © are 
satisfied. 
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Appendix II 
Proof of Proposition [JTJ 

A. Converse 

Similar to the converse in Appendix HI we use the Fano inequality given in (f34b - Then we have 

n(R 1 +R 3 ) = H(W 1 ,W 3 ) = H(W 1 ,W 3 \W 2 )<I(W 1 ,W 3 -Y n \W 2 )+n5 n (42) 

n 

= J2H(Y i \W 2 ,Y i ~ 1 ) - H(Y i \W 1 ,W 2 ,W 3 ,Y i - 1 ) + n5 n (43) 
i=l 

n 

= J2 H ( Y i\ x 2i ^Y'- 1 )- H(Yi\X u , X 2i , X 3i , W l , W 2 , W 3 , r- 1 ) + n5 n (44) 
i=i 

n 

< J2 H ( Y i\ X 2i) - H(Y i \X li ,X 2i ,X 3i ) + n5 n (45) 
i=i 

n 

= /(Xii , X 3i ; y< I X 2l ) + n<5 n (46) 
i=i 

where (I43T ) follows from the fact that conditioning reduces entropy and (Wi, W2, W3, — (X^, X2i, -X31)— 
li forms a Markov chain. The other inequalities follow similarly. 

B. Achievability 

Code Construction: Generate 2 nRl codewords x™{w\), w\ G [l,2 ni?1 ] by choosing each i-th letter i.i.d. 
from probability distribution p{x\), i = 1,2, ...,n. For each w\, generate 2 nRi codewords x§ (wi, W3), 
w 3 G [l,2 ni?3 ], i.i.d. according to Yi7=i P{ x 3i\ x u( w i))- Finally, generate 2 ni?2 codewords ^2(^2) i-i-d. with 
each letter drawn according to p{x 2 ). 

Encoding and error analysis are rather standard (similar to Appendix A) and are thus omitted. 

Appendix III 
Proof of Proposition [33] 

A. Converse 

The converse follows from standard arguments based on the Fano inequality (see, e.g., Appendix [J). 
Here, for illustration, we derive only the first bound in ([8]), i.e., R\ < I(Xi;Y\X 2 , X 3 ,Ui,U 2 ) + C\, as 
follows. Define as V\ G Vi and V 2 G V 2 the messages of cardinality \Vi\ < C% sent over the two links from 
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the sources to the relay. Notice that Vi is a function only of Wi and that X% is a function only of W3, Vi 
and V 2 . Considering decoding of W\, from the Fano inequality 

HiW^^MW) <nS n , 

we get 

nRi =H(Wi\ W 2 ) < I{Wi ; Y n , V h V 2 \ W 2 ) + nb n 

< /( Wi ; Vi 1 W 2 ) + I(Wx ; V 2 \ Vi , W 2 ) + /( W 1 ; Y n \ W 2 , Vi , V 2 ) + n5 n 

< nC\ + /(Wi; Y n |W 2 , Vi, F 2 ) + n5 n 

n 

= nCi + J2 HiYilY'- 1 , W 2 , Vi, V 2 ) - H(Yl\Y i ~ l , W X ,W 2 , Vi, V 2 ) 
i=i 

n 

= nC 1 + Y, H(Yi \Y { -\ X 2i , X 3l , W 2 , E/u , £/ 2i ) - (Y, | Y^ 1 , X u , X 2i , X 3l ,W X ,W 2 , U u , U 2i ) 

8=1 

n 

< nC x + £ i/Q^M, X 3i , C/i is U 2i ) - HpilX^X^Xai, U u , U 2i ) 

n 

i=i 

where in the third line we have used the facts that /(Wi; Vi|W 2 ) < iZ(Vi) < nCi and /(Wi; VijVi, W 2 ) = 
and the definitions Uu = Vi and U 2 i = V 2 . The proof is concluded similarly to Appendix U 

B. Achievability 

Code Construction: Split the message of the terminals as Wj = [Wj p Wj C ] with j = 1,2, where Wj p 
stands for the "private" message sent by each terminal without the help of the relay and Wj c for the 
"common" message conveyed to the destination with the help of the relay. The corresponding rates are 
R\ = Rip + Ri c and R 2 = R 2p + R 2c . Generate a sequence Q n i.i.d. using p(q) for i = 1,2, ...,n. 
Fix a realization of such a sequence Q n = q n . Generate 2 nRjc codewords u™(wj c ), Wj C G [l,2 ni?3C ] by 
choosing each ith letter independently with probability p(uj\qi), i = 1,2, ...,n, for j = 1,2. For each Wj c , 
generate 2 nRjp codewords x"j(wj c , Wj p ), j = 1, 2, Wj p G [1, 2 nR]P ], i.i.d. with each letter drawn according to 
p(xj\uji(wj c ), q{). Finally, for each pair w\ c , w 2c , generate 2 nRa codewords x 3 (wi c , w 2c , w 3 ), w 3 G [1, 2 nRi ], 
i.i.d. according to p(x 3 \uu(wi c ), u 2i (w 2c ), qi). 
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Encoders: Given the messages and the arbitrary rate splits at the transmitters (w\ = [w\ p wi c ],W2 = 
[w2 P W2c],W3), encoder 1 and encoder 2 send the messages w\ c and W2 C , respectively, over the finite-capacity 
channels which are then known at the relay before transmission. Terminal 1 and terminal 2 then transmit 
xJiwjcWjp), j = 1,2, and the relay transmits X3 (wi c , W2 C , w 3 ). 

The rest of the proof follows similarly to Appendix |TJ by exploiting the results in Sec. VII of [14]. 

Appendix IV 
Proof of Proposition 14.21 

We use the classical block Markov encoding for achievability, and we assume \Q\ = 1 for the sake of 
brevity of the presentation. Generalization to arbitrary finite cardinalities follows from the usual techniques 
(see, e.g., Appendix [jj). 

Codebook generation: Generate 2 nRk i.i.d. codewords xjjf from probability distribution p(x^,) = Yii=i P( x ki) 
for k = 1,2. Label each codeword, for k = 1,2, as x^(wk), where G [l,2 nRfc ]. Generate 2™(- Ro+R3 ) 
i.i.d. codewords x% from probability distribution p(x^) = JlILi P( x 3i)- Label each codeword as x^(s,w 3 ), 
where s G [1,2"^°] and w 3 G [l,2 ni?3 ]. Also, for each x 3 l (s,w 3 ), generate 2 nRo i.i.d. sequences y% from 
probability distribution p(y^ (s, w 3 )) = IliLi P(V3i\ x 3 ( s > w 3))> where we define 

p(m\ x 3) = ^2 P( x i)p( x 2)p(yi,y2,y3\xi,x 2 ,x 3 )p(y 3 \y 3 ,x 3 ). 
Xi,x2,yi,y2,y3 

We label these sequences as y 3 (m\s,w 3 ), where m G [l,2 nito ], s G [1,2 °] and w 3 G [l,2 ni?3 ]. 

Encoding: Let (wu, W2i, w 3 i) be the message to be transmitted in block i, and assume that (^3 (s* | Si— 1 , ^3,1-1), Y 3 {i— 
1), £3 (sj_i, wz^-i)) are jointly typical. Then the codewords x™(wu), X2(w2i) and x^(si,w 3 i) will be 
transmitted in block i. 

Decoding: After receiving y 3 (i), the relay finds the index Sj+i such that 

(£3X s m|si,w3,0>y3 Wi^3( s ^3;)) e T^ nX3]/ 

For large enough n, there will be such Sj+i with high probability if 

Rv>I(Y 3 ;Y 3 \X 3 ). 

WeftxR = I(Y 3 ;Y 3 \X 3 ) + e. 
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At the end of block i, the receiver k finds indices and w 3i such that 

^(stw k 3i ),y n k (i))eT^ X3Yk]e 

and 

iy^l\stiM^il^(sliM^i),yUi - 1)) e zj^^ 

are simultaneously satisfied, assuming that s^Li and w 3i _ 1 have been previously correctly estimated. Re- 
ceiver k will find the correct pair (si,w 3 i) with high probability provided that n is large enough and that 

R 3 + I(Y 3 ;Y 3 \X 3 ,Y k ) < I(X 3 ;Y k ). 

Assuming that this condition is satisfied so that = and w 3i _ 1 = w 3i _i, using both ysi s i\ s i-iJ w 3 i-i) 
and y k (i), the receiver k then declares that (w* i&2i-l) was sent 01 block i — 1 if 

We have i&2i-l) = (^l.i-ij w 2,i-i) with high probability provided that n is large enough, 

R 1 <I(X 1 ;Y k ,Y 3 \X 3 ,X 2 ), 

R 2 <I(X 2 ;Y k ,Y 3 \X 3 ,X 1 ), (47) 

and 

fli + ifc < I(X 1 ,X 2 ;Y k ,Y 3 \X 3 ), 

for it = 1,2. 

Appendix V 
Proof of Proposition 03] 



To simplify the presentation, here we prove (1181 ) for R 3 = 0. The case with R 3 > follows similarly by 
following the same reasoning as in Appendix II-BI From the Fano inequality, for i = 1,2, 

H(W 1 ,W 2 \Y^)<n5 n , 
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where 5 n — ► for n — > oo, from which it follows that H(Wi\Y™) < n5 n and H(W 2 \Y™) < n5 n . For the 
relay terminal, we have 

(Wi|W 2 , Y 3 n ) = H{W 1 \W 2 , X$, Y 3 n , X 3 n ) (48) 
<H{Wy\X2,X%) (49) 
= H(W 1 \X%,X%,Y 2 n ) (50) 

< H(W X \Y?) (51) 

< n5 n , (52) 



where (1481 ) follows since X 2 is a function of W 2 and X 3 is a function of Y 3 n ; (1501 ) follows since Y 2 — 
(X 2 ,X 3 ) — W\ form a Markov chain based on the assumption in (117ak (1511 ) follows since conditioning 
reduces entropy; and finally d52l follows from the Fano inequality. Similarly, we can also show 

H(W 2 \W 1 ,Y^)<n6 n . 

We also define the auxiliary random variables Uu = W\ and Un = W 2 , for i = 1, . . . , n. It follows that: 

n-Ri = ff(Wi) = H{W 1 \W 2 ) (53) 
</(^ i; y 3 n |Ty 2 ) + n5 n (54) 

n 

= HWi; Y 3i \w 2 , y;- 1 ) + n8 n 

1=1 

n 

= J2H(Y 3i \Yt\W 2 ) - HiY^Wx^Yt^ + ndn 
i=i 

n 

= J2 H ( Y 3i\Yt\W 2 ,X 2i ,X 3i )-H{Y 3i \W 1 ,W 2 ,Yt 1 ,X li ,X 2i ,X 3i ) + n8 n (55) 

i=l 
n 

< Y,H{Y 3i \W 2 ,X 2h X 3 {)-H{Y 3i \X lh X 2i ,X 3i ) + n8 n (56) 
i=i 

n 

< H ( Y 3i I U 2i , X 2i ,X 3i )- H(Y 3i I U 2l , X xi , X 2l ,X 3l ) + nb n (57) 
i=i 

n 

= I(Xli ; Y 3i I U 2l , X 2l , X 3t ) + nb n , (58) 
i=l 

where d54l follows from (|52l ); (|55l ) follows as Xh and X 2 { are functions of Wj. and W 2 , respectively, and 
X 3 i is a function of Y 3 ; d56l ) follows from the fact that conditioning reduces entropy and also the fact 
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that Y32 — (Xn, X 2 i, Xsi) — (Wi, W 2 ,Y 3 l ); and again ( 1571 ) follows from the fact that conditioning reduces 
entropy and the definition of auxiliary random variable Un- Similarly, we can show that 

n 

nR 2 < Y,I(X 2i -Y Zi \U lu X u ,X M ). (59) 
i=i 

Next, we consider the bounds due to decoding at receivers. Focusing on the first message and the first 
receiver, we have 

nRi = H{W X \W 2 ) < I(Wr,Y?\W 2 ) + n5 n 

n 

= H(Y U \W 2 , Yt 1 ) - HiYulWx, W 2 , Y^ 1 ) + nb n 
i=i 

n 

= Y J H(Y li \W 2 ,Yt 1 ) ~ HiYulW^W^X^X^X^Yl'^ + nSn (60) 
i=i 

< ^ H(Y U \W 2 ) - H(Y U \X U , X 2l ,X 3t ) + n5 n (61) 
i=i 

n 

= Y,H(Y li \W 2 )-H(Y li \X li ,X 3i ) + n5 n (62) 
i=l 
11 

< Y,H(Y u \U 2i ) - H{Y ll \X li ,X 3u U 2l )+n5 n (63) 
i=l 
n 

= ]T I(X U , X 3i ■ Y u \U 2i ) + n5 n (64) 

i=l 

where (l60l follows since Jfij and X2i are functions of W\ and W2, respectively, and X 3 % is a function 
of Y£~ , (|6Tb follows as conditioning reduces entropy and — (JTxi>-^2i>-^3i) — (Wi,W 2 ,Y^~ 1 ) forms 
a Markov chain; (l62l) follows since Y~ij — (Xij,X3j) — X2i forms a Markov chain; and finally in (l63l) we 
simply used the definition of U 2 t and the fact that conditioning reduces entropy. We can similarly obtain 

n 

nR 2 < J2l{X 2i ,X 3i ;Y 2i \Uu) + nd n . 

i=l 
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We now focus on the first message and the second receiver. We have 

n.Ri = H{Wi) 

<I(W i; Y 2 n \W 2 )+n5 n 

n 

= H(Y 2i \W 2 , Yf 1 ) - H{Y 2i \W 1 , W 2 , Yt 1 ) + n5 n 



i=i 

n 



= H(Y 2i \W 2 , Yt\X 2i ) - H(Y 2t \W 1: W 2 , X u ,X 2i , X 3i , KT 1 ) + nS n (65) 
i=i 

n 

< Y,H(Y u \W 2 ,X 2t ) - H(Y 2i \X u ,X 2i ,X 3i )+nS n (66) 
i=i 

n 

= H(Y 2i \W 2 ,X 2t ) - H(Y 2l \X 2t , X 3i ) + nS n (67) 
i=i 

n 

< Y H(Y 2i \U 2t , X 2i ) - H{Y 2i \X 2i ,X 3i , U 2i ) + nS n (68) 



i=i 

n 



= 1{X 3t] Y 2t \U 2l ,X 2i ) + nS n . (69) 

i=l 

We can similarly obtain 

n 

nR 2 < YjI{Xzi,Y u \U u ,X u ) + nS n . 

i=i 

We also have 

n(Ri + R 2 ) < I(W U W 2 ; Y?) + n5 n 

n 

= Y,H(Y li \Yt 1 ) - HiYulWuW^Y;- 1 ) +nS n 
i=i 

n 



= Y J H{Y lt \Yl- 1 ) - H{Y li \X li ,X 2i ,X 3i ,W 1 ,W 2 ,Yt 1 ) + n5 n 

i=l 

n 

<J2H(Y U ) - H{Y lj \X u ,X 2i ,X 3 {) + n6 n 

n 

< H(Y U ) - H(Y u \X u ,X 3l ) + nS n 



i=i 



= Y I i X ^ X 3i ;Yii) + nS n , 



i=i 
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and similarly 

n 

n(Ri + R 2 )<Y1 J ( X 2i, X 3i ; Y 2i ) + n5 n . 
i=l 

Now, introducing the time-sharing random variable Q uniformly distributed in the set {1, 2, .., n} and defining 

Xj = X jQ for j = 1, 2, 3, Yj = Y jQ and U j = U jQ for j = 1, 2, we get CE! (for R 3 = 0). 

Appendix VI 
Proof of Proposition 15. II 

We first prove the converse showing that (l22l ) serves as an outer bound, and prove the direct part describing 
a structured coding scheme that achieves the outer bound. 

To prove the converse, it is sufficient to consider the outer bound given by (fT8T ) as applied to the channel 
characterized by (1211 ). and show that an input distribution (fT9l ) with Xi, X2, X3, U\, U 2 ~ 6(1/2) and 
independent of each other maximizes all the mutual information terms. To this end, notice that in the outer 
bound (fT8T ) with R% = ignoring all the constraints involving auxiliary random variables can only enlarge 
the region, so that we have the conditions: 

R 1 <I(X 1 ;Y 3 \X 2 ,X 3 ,Q) 1 (70) 
R 2 <I{X 2 ;Y 3 \X 1 ,X 3 ,Q) (71) 

and 

R 1 + R 2 <mm{I(X 1 ,X 3 ;Y 1 \Q),I(X 2 ,X 3 ;Y 2 \Q)}. (72) 

We can further write 

I(Xi;Y 3 \X 2 ,X 3 ,Q)=H(Y 3 \X 2 ,X 3 ,Q)-H(Y 3 \X 1 ,X 2 ,X 3i Q) 
< H(Y 3 ) - H b {e 3 ) < 1 - H b (e 3 ), 

and 

I(X 1 ,X 3] Y 1 \Q) = H(Y 1 \Q)-H(y 1 \X 1 ,X 3 ,Q) 

<H{Y X )-H b {e 1 ) <l-H b ( Sl ). 

We can see that the inequalities hold with equality under the above stated input distribution, which concludes 
the proof of the converse. 



February 18, 2009 



DRAFT 



37 



We now prove the direct part of the proposition. First, consider R\ > R%. Transmission is organized into 
B blocks of size n bits. In each of the first B — 1 blocks, say the 6th, the j-th transmitter, j = 1, 2, sends 
nRj new bits, conventionally organized into a 1 x [nRj\ vector Uj &. Moreover, encoding at the transmitters 
is done using the same binary linear code characterized by a ["--RiJ x n random binary generator matrix G 
with i.i.d. entries £>(l/2). 

Specifically, as in [23], terminal 1 transmits xi & = u^G and terminal 2 transmits x 2i 6 = [0 u 2j fc]G 
where the all-zero vector is of size 1 x [ n -RiJ — L n ^2j (zero-padding). Since capacity-achieving random 
lineal - codes exist for BS channels, we assume that G is the generating matrix for such a capacity achieving 
code. 

We define 113^ = Ui^ © [0 U2&]. The relay can then decode 113,5 from the received signal y 3) & = 
U3 5G + Z3 since xi^ © X2,& is also a codeword of the code generated by G. This occurs with vanishing 
probability of error if (|22a| ) holds (see, e.g., [24]). In the following (b + l)-th block, the relay encodes 
u 3 b using an independent binary linear code with an |_ n -RiJ x n random binary generator matrix G3 as 
X3 6 + i = U3 ,&G3. We use the convention that the signal sent by the relay in the first block is x 3) i = or 
any other known sequence. 

At the end of the first block (b = 1), where the relay sends a known signal (that can be canceled by 
both receivers), the j-th receiver can decode the current nRj bits Uj i from the jth transmitter if Rj < 
1 — H}y{ej). Under this condition, we can now consider the second block, or any other (b + l)-th block, 
assuming that the j-th receiver already knows u^f,. In the (b + l)-th block, the first receiver sees the signal 
yi,6+i = u^b+iG © u 3) &G3 © zi. However, since ui^ is known at the first receiver, it can be canceled 
from the received signal, leading to y[ b+1 = ui^+iG © u 2 ^G' 3 © zi, where G 3 is a [^^2] x n matrix that 
contains the last [^^2] rows of G3. Due to the optimality of random linear codes over the BS MAC (see, 
e.g., [24]), u^b+i and U2 t b are correctly decoded by the first receiver if R\ + i?2 < 1 — il&(£i). Repeating 
this argument for the second receiver and then considering the case R\ > R2 concludes the proof. 

Appendix VII 
Proof of Proposition [63] 

We first give a brief overview of lattice codes (see [18], [22] for further details). An n-dimensional lattice 
A is defined as 

A = {GX : X G Z n }, 
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where G £ R n is the generator matrix. For any x £ H n , the quantization of X maps X to the nearest lattice 
point in Euclidean distance: 



The fundamental Voronoi region V(A) is defined as V(A) = {X : Q\{X) = 0}, whose volume is denoted 
by V(A) and is defined as V(A) = Jy/ A N dX. The second moment of lattice A is given by 



We use a nested lattice structure as in [19], where A c denotes the coarse lattice and Af denotes the fine 
lattice and we have A c C Af. Both transmitters use the same coarse and fine lattices for coding. We consider 
lattices such that G(A C ) rs ^ and G(Af) f» whose existence is shown in [19]. In nested lattice coding, 
the codewords are the lattice points of the fine lattice that are in the fundamental Voronoi region of the 
coarse lattice. Moreover, we choose the coarse lattice (i.e., the shaping lattice) such that cr 2 (A c ) = P to 
satisfy the power constraint. The fine lattice is chosen to be good for channel coding, i.e., it achieves the 
Poltyrev exponent [19]. 

We use a block Markov coding structure, that is the messages are coded into B blocks, and are transmitted 
over B + 1 channel blocks. The relay forwards the information relating to the messages from each block 
over the next channel block. The relay is kept silent in the first channel block, while the transmitters are 
silent in the last one. The receivers decode the messages from the transmitters and the relay right after each 
block. Since there is no coherent combining, transmitters send only new messages at each channel block, 
thus sequential decoding with a window size of one is sufficient. We explain the coding scheme for two 
consecutive channel blocks dropping the channel block index in the expressions. 

Each transmitter i maps its message Wi to a fine lattice point Vi G Af n V(A C ), i = 1,2. Each user 
employs a dither vector Ui which is independent of the dither vectors of the other users and of the messages 
and is uniformly distributed over V(A C ). We assume all the terminals in the network know the dither vectors. 



Q A (X) ^argmin||X-Q||. 



The mod operation is defined as 



X mod A = X - Q A {X). 




while the normalized second moment is defined as 
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Now the transmitted codeword from transmitter i is given by X; L = (Vj — Ui) mod A c . It can be shown 
that Xi is also uniform over V(A C ). 

At the end of each block, we want the relay to decode V = (V\ + V 2 ) mod A c instead of decoding both 
messages. Following [22] (with proper scaling to take care of the channel gain 7), it is possible to show 
that V can be decoded at the relay if 

R < - log 2 |A, n V(A C )| < I log (\ + 7 2 p) . (73) 

Then in the next channel block, while the transmitters send new information, the relay terminal broadcasts 
the index of V to both receivers. The relay uses rate-splitting [27], and transmits each part of the V index 
using a single-user random code. Let R± and R2 be the rates of the two codes the relay uses, with power 
allocation 5 and P — 5, respectively. Each receiver applies successive decoding; the codes from the relay 
terminal are decoded using a single-user typicality decoder, while the signal from the transmitter is decoded 
by a Euclidean lattice decoder. Successful decoding is possible if 

fli < ilog(l + r/ 2 £), 

^4 log ( 1 + r^) and 

1 / 7] 2 (P-5) 

where R\ + R2 = R. This is equivalent to having 

R < ji log (1 + rfP) , ~ log (1 + P) , i log (1 + (1 + r?)P) \ . 
Combining this with d73l . we obtain the rate constraint given in the theorem. 

Appendix VIII 
Proof of Proposition 16. II 

The proof follows from [15], and here we only briefly sketch the main arguments. The first step is to notice 
that the capacity region for the Gaussian channel d23l with power constraints (|24l) is given by the region 
© where the further constraints E[Xj] = Y^q=i P{l)^[-^j\Q = q] < Pj f° r 3 = 1)2,3 are imposed on the 
input distribution. One then fixes a given value of Q = q and powers Pj(q) = E[Xj\Q = q] and shows that, 
for any given input distribution (recall ©) (Ui q , U 2q , X\ q , X 2q , X 3q )~ p(x±,ui\q)p(x 2 , U2\q)p(x 3 \u 1 ,u 2 , q) 



February 18, 2009 



DRAFT 



40 



satisfying the power constraints E[Xj\Q = q] < Pj(q), one can find a set of jointly Gaussian variables 
{Ufqi U 2q , X± t ', X% q , X® '' ) such that: (i) the joint distribution can be factorized as in ((6]); (ii) the power 
constraints are satisfied; and (Hi) all the mutual information terms in © are larger than or equal to the 
corresponding values obtained with the original distribution. 

Notice that, as discussed in [15], the existence of such a tuple of Gaussian variables does not follow 
immediately from the conditional maximum entropy theorem. In fact, variables satisfying given Markov 
constraints (as in ([6])) might have a covariance matrix for which a joint Gaussian distribution with the same 
Markov constraints cannot be constructed. However, in our case, similarly to [15], jointly Gaussian random 
variables satisfying can be found as discussed below. 

First, for a given tuple (Ui q , U 2q , Xi q , X 2q , X 3q ), construct another tuple (Vi q , V 2g , X\ q , X 2q , X 3q ) with 
V\q = E[X\\Ui) - E[Xi] and V 2q = E[X 2 \U 2 ] - E[X 2 ]. It can be readily seen that with this new input 
distribution, all the mutual information terms in ([5]) are larger than or equal to the corresponding values for 
(Ui q , U 2q , X\ q , X 2q , X 3q ) (property (Hi) above). In fact, the non-conditional term d5dl ) is unchanged (due to 
the fact the joint distribution of Xi q , X 2q , X 3q has not changed), while the remaining terms, which contain 
conditioning with respect to V\ q , V 2q , are increased, since V\ q , V 2q are deterministic functions of U\ q , U 2q , re- 
spectively [15]. Now, define (uf q , U$ q , Xf q , X% q , Xf ? ) as the zero-mean jointly Gaussian tuple with the same 
covariance matrix as (Vi q , V 2q ,Xi q , X 2q: X 3q ). For this distribution the power constraints are clearly satisfied 
(property (ii) above). Moreover, we can show that this jointly Gaussian distribution factorizes as (O (property 
(/)). This follows similarly to Lemma 6 in [15]. In fact, Xi q ,X 2q are independent and, since Ui q ,U 2q 
are independent, so are V\ q ,V 2q ; as a consequence, the covariance matrix of (Uf , U^q, Xf , X^q, X^ q ) 
is fully defined by the subco variance matrices of (llf ,X?,x9) and (U^ , X% , X^ ). Now, since both 
submatrices satisfy the conditions of Lemma 5 of [15], we can conclude that the jointly Gaussian vector 
(Ui q , U^qjXfq, X^q, X^ q ) satisfies the desired Markov conditions ([6]). It finally follows from the conditional 
maximum entropy theorem that (Uf , U^q, Xf q , X^, X% ) is optimal for a given Q = q. The final step is 
to use the concavity of the mutual informations at hand with respect to the powers Pj(q) to see that time 
sharing (i.e., the variable Q with non-singleton domain) is not necessary. 

From the arguments sketched above, we conclude that a zero-mean jointly Gaussian distribution is optimal, 
thus we can write 

X 3 = V^PjU j + ^-p-S j (74) 
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for j = 1, 2 and 

x 3 = y^^c/i + y^f/ 2 + x/^S 3 (75) 

with < ot\, a.2, a' 3 , a 3 < 1, a' 3 + a 3 < 1, a\ = 1 — ai, «2 = 1 — ®-2 and a 3 = 1 — a' 3 — a 3 , and where 
Ui and 5*j are independent zero-mean and unit-variance Gaussian random variables. The capacity region (f5]) 
then reads 

R 3 < ilog(l + a 3 P 3 ), (76) 
Ri + Rs < \ log(l + aiPj + a 3 P 3 + ( V^Pl" + y«^) 2 )), (77) 
i?2 + i?3 < \ log(l + 02^2 + a 3 P 3 + + ^4^) 2 )) and (78) 

i?i + i? 2 + i? 3 < ~ log(l + Pi + P 2 + P 3 + 2 y Ja 1 P 1 a' 3 P 3 + 2yJa 2 P 2 a^P 3 ), (79) 
where each term is clearly seen to be maximized by ot\ = 1 and 02 = 1. This concludes the proof. 
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